-
Notifications
You must be signed in to change notification settings - Fork 920
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve distinct
by using cuco::static_map::retrieve_all
#10916
Improve distinct
by using cuco::static_map::retrieve_all
#10916
Conversation
rerun tests |
distinct
: pass stream to map::contains
distinct
by using cuco::static_map::retrieve_all
Seeing performance regression when using
Will look into profiling tomorrow. |
Did you get these numbers by pulling the same |
Codecov Report
@@ Coverage Diff @@
## branch-22.08 #10916 +/- ##
===============================================
Coverage ? 86.32%
===============================================
Files ? 144
Lines ? 22668
Branches ? 0
===============================================
Hits ? 19569
Misses ? 3099
Partials ? 0 Continue to review full report at Codecov.
|
Agreed.
|
Doubtful. I don't see why there would be an increased register usage problem in this situation. The problem from #10511 was that we were using a For |
Updates: the performance regression comes from a bug in the current |
Merging this PR since the performance regression issue will be resolved after merging #10983 |
@gpucibot merge |
Closes #10909
This PR was intended to fix a bug in the
distinct
implementation where the stream parameter was not passed when invokingstatic_map::contains
. During the work, @ttnghia Pointed out that thecontains
+thrust::copy_if
logic can be simplified by usingstatic_map::retrieve_all
. Finally, the PR fetches a newer version ofcuco
to utilizeretrieve_all
and fixes a bug in unit tests where results should be sorted before comparison.